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1.0  INTRODUCTION 


Any  transmission  or  reception  at  visible  or  infrared  wavelengths  may  be  adversely  af¬ 
fected  by  cloudiness.  For  more  than  20  years  this  has  been  the  primary  factor  driving 
researchers  toward  accurate  characterization  of  cloud-free  regions. 

Early  investigators  (McCabe,  1965;  Lund,  1965;  Lund  and  ShanHin,  1972, 1973;  Rapp  et 
al.,  1973)  used  whole-sky  photographs  and/or  cloud  climatologies  to  estimate  probabilities 
of  cloud-free  lines-of-sight  from  a  specific  point  at  the  surface  to  a  specific  point  in  the 
atmosphere.  No  estimates  of  the  extent  of  the  cloud-free  area  surrounding  the  lines-of-sight 
were  made.  Grantham  et  al.  (1979)  addressed  this  deficiency.  They  detailed  methods  used 
to  determine  cloud-free  fields-of-view.  These  methods  were  generalized  to  treat  the  case  of 
an  observer  above  the  surface  looking  down  as  well  as  the  previously  treated  case  of  the 
ground-based  observer. 

Interest  in  cloud-free  regions  as  viewed  from  atop  the  atmosphere  has  grown  as  the 
importance  of  remote  sensing  by  satellites  has  increased.  But,  satellite  instruments  do  not 
record  cloud-free  regions  in  the  same  way  as  the  human  observer.  Their  radiometers  scan 
one  line  at  a  time  to  produce  an  image  of  whatever  lies  below.  Malick  et  al.  (1979),  imi¬ 
tating  this  scanning  method,  modeled  probabilities  of  cloudy  and  clear  intervals  of  specific 
lengths  along  straight-line  paths.  They  accomplished  this  by  thresholding  satellite  imagery 
to  separate  cloudy  and  dev  regions  and  then  constructing  histograms  of  the  doudy  and 
dev  interval  lengths.  The  resulting  probability  densities  for  both  doudy  and  dev  lengths 
were  distributed  approximatdy  exponentially  with  short  intervals  being  much  more  likely 
than  long  intervals.  Because  the  pvameter  used  in  the  exponential  probability  density 
function  is  derived  from  the  mean  of  the  random  variable  in  question,  Malick  et  al.  (1979) 
suggested  use  of  the  arithmetic  means  of  doudy  and  clev  interval  lengths  to  estimate  those 
pvameters. 


Recent  research  has  built  upon  the  work  of  Malick  et  al.  (1979).  Goldstein  and  Janota 
(1985)  provided  a  rigorous  theoretical  treatment  of  the  exponential  modeling  of  cloudy  and 
clear  intervals  along  finite  paths.  Snow  et  al.  (1985b)  and  Snow  (1985)  used  photographic 
imagery  gathered  during  space  shuttle  orbiter  missions  to  model  interval  length  number 
densities  with  gamma,  lognormal,  and  exponential  functions.  Snow  and  Will  and  (1986), 
again  using  space  shuttle  imagery,  best  fit  interval  length  probabilities  with  the  exponential 
probability  density  function.  They  also  illustrated  that  cloud  fraction  values  generated  from 
very  small  portions  of  the  original  image  exhibited  little  variablity,  suggesting  that  image 
subsections  could  estimate  the  character  of  the  entire  image. 

As  mentioned,  previous  works  were  primarily  concerned  with  the  probability  that  a 
cloudy  or  clear  interval  equalled  a  specific  length  [Pr(X=x)j.  A  function  used  to  describe 
such  an  equality  is  called  a  probability  density  function  (PDF).  A  topic  mentioned  only 
in  passing  by  Malick  et  al.  (1979)  was  the  probability  that  a  cloudy  or  clear  interval  was 
greater  than  at  equal  to  a  specific  length  [Pr(X>x)j.  A  function  used  to  describe  this  is 
called,  for  want  of  a  better  term,  one  minus  the  cumulative  distribution  function  (1-CDF) 
and  is  simply  the  integral  of  the  PDF  from  x  to  infinity.  These  functions  are  simply  alternate 
methods  used  to  characterize  cloudy  and  clear  areas. 

Characterizations  of  these  types  may  be  quite  useful.  An  airborne  or  spacebome  observer 
might  wish  to  view  or  transmit  a  signal  to  some  point  at  the  surface.  If  the  observer  needed 
a  break  in  the  clouds  of  at  least  one  kilometer,  he/she  could  calculate  the  clear  interval 
1-CDF  and  use  it  to  predict  a  probability  of  success.  The  1-CDF  for  clear  intervals  might 
also  prove  useful  to  navigational,  earth  resource,  and  cartographic  satellite  mission  planners. 

Atmospheric  modelers  might  use  such  characterizations  to  improve  parameterizations 
of  cloudiness.  Using  a  cloudy  interval  PDF  would  provide  greater  detail  than  the  simple 
assignment  of  a  particular  fractional  cloud  cover.  If  the  modeler  were  not  interested  in 
cloudiness,  but  rather  in  the  distribution  of  moisture  or  vertical  motion,  cloudy  intervals 
could  be  used  as  proxy  variables  to  determine  these  distributions. 

The  purpose  of  this  study,  then,  is  to  accurately  model  the  PDFs  and  1-CDFs  of  both 
cloudy  and  clear  interval  lengths  within  broken  cloud  fields.  Three  models  are  tested;  their 
goodness- of- fit  is  evaluated  with  metric  and  congruent  statistical  methods  not  previously 
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used  in  these  types  of  analyses.  Image  fractions  as  sma.1I  as  1/32  are  evaluated  to  determine 
how  well  they  characterize  the  whole  image. 

The  imagery  used  consists  of  16  digitized  cloud-field  photographs  taken  during  four 
space  shuttle  missions.  This  nearly  doubles  the  number  of  cases  considered  to  date  in  this 
line  of  research.  This  thesis  constitutes  the  first  formal  report  of  its  kind  using  very  high 
resolution,  photographic  cloud  imagery  collected  aboard  the  space  shuttle. 


2.0  DATA  DESCRIPTION 


2.1  DATA  SOURCE 

Space  shuttle  photographic  images  were  chosen  as  data  for  this  study  because  of  their 
very  high  resolution  and  to  illustrate  their  utility.  Other  data  sources  are  available  (e.g., 
Landsat)  but  are  not  commonly  used  to  record  cloud  scenes. 

The  16  digitized  photographic  images  were  collected  during  four  space  shuttle  missions 
between  September  1984  and  October  1985.  Both  hand-held  35  millimeter  and  large  format 
cameras  were  used.  The  hand-held  camera  photographs  were  collected  as  part  of  the  Cloud 
Logic  to  Optimize  Utilization  of  Defense  Systems  (CLOUDS)  experiment.  This  experiment 
is  sponsored  by  the  United  States  Air  Force  and  directed  by  the  Air  Force  Geophysics 
Laboratory  (AFGL).  The  large  format  camera  photographs  were  collected  as  part  of  the 
Orbital  Camera  Payload  System  (OCPS)  project.  This  project  is  co-sponsored  by  six  United 
States  Government  agencies  and  directed  by  the  Large  Format  Interagency  Working  Group. 
Magnetic  tapes  of  these  digitized  images  were  provided  by  AFGL. 

2.2  IMAGE  SELECTION 

The  purpose  of  this  study  was  to  model  cloudy  and  clear  interval  probability  distri¬ 
butions  in  areas  of  broken  cloudiness.  For  this  reason,  images  with  vast  areas  of  overcast 
or  completely  clear  skies  were  not  used.  The  images  were  recorded  from  very  dose  to  the 
zenith  to  eliminate  the  apparent  shortening  of  dear  intervals  and  the  distortion  of  doudy 
intervals  resulting  from  non-zero  nadir  angles.  Althought  many  images  were  collected  dur¬ 
ing  the  CLOUDS  and  OCPS  projects,  only  16  of  them  met  both  of  these  requirements. 
Fortunately,  11  of  the  16  images  chosen  came  from  latitudes  higher  than  any  previously 
considered.  The  predominant  doud  types  in  these  images  were  cumulus  and  stratocumulus. 


i 
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2.3  IMAGE  COLLECTION  AND  CHARACTERISTICS 


Five  of  the  sixteen  photographic  images  were  taken  with  the  hand-held  camera  on  three 
separate  shuttle  missions.  They  were  taken  through  the  overhead  flight  deck  windows.  The 
remaining  11  photographic  images  were  taken  with  the  large  format  camera  mounted  in  the 
shuttle  cargo  bay.  Table  2.1  provides  some  particulars  of  the  16  photographs  and  Figure 
2.1  shows  the  locations  from  which  they  were  taken.  Both  the  shuttle  bay  and  overhead 
windows  are  directed  earthward  during  normal  shuttle  operations. 

A  thorough  treatment  of  image  collection  systems  is  provided  by  Snow  and  Tomlinson 
(1987).  Information  of  primary  interest  to  this  study  is  shown  in  Table  2.2. 

These  image  collection  systems  are  non-standard  (i.e.,  photographic)  and  record  cloud 
scenes  differently  than  satellite  radiometers.  Photographic  emulsion  charateristic  curves 
relate  the  logarithm  of  film  exposure  to  film  density.  Film  exposure  and  scene  reflectance 
are  linearly  related  (Jensen,  1968)  and  film  density  is  simply  the  logarithm  of  opacity.  So 
the  film  characteristic  curve,  in  the  end,  depicts  the  relationship  between  scene  reflectance 
and  film  opacity.  On  the  other  hand,  satellite  radiometer  calibration  curves  relate  scene 
reflectance  to  reflectance  count  values. 

Both  of  the  films  in  Table  2.2  exhibit  linear  characteristic  curves  across  the  range  of 
exposures  used  to  collect  imagery  for  this  study.  Film  contrast  is  higher  than  in  the  original 
scene  because  the  slopes  of  these  curves  exceed  one.  (The  effect  of  this  contrast  enhancement 
on  digitization  is  discussed  in  Section  3.1.) 


TABLE  2.1  IMAGE  SPECIFICATIONS 


IMAGE 

NUMBER 

LOCATION 

LATITUDE 

(degrees 

north) 

DATE/TIME** 

(YYMMDD/ 

HHHH)(GMT) 

ORBITAL 

ALTITUDE 

(km) 

1* 

Dominican 

Republic 

19 

840902/1530 

300 

2 

Bering  Sea 

57 

841006/2100 

270 

3 

Bering  Sea 

57 

841006/2100 

270 

4 

Bristol  Bay 

57 

841006/2100 

270 

5 

Bristol  Bay 

57 

841006/2100 

270 

6 

Bristol  Bay 

57 

841006/2100 

270 

7 

Bristol  Bay 

57 

841006/2100 

270 

8 

Bristol  Bay 

57 

841006/2100 

270 

9 

Bristol  Bay 

57 

841006/2100 

270 

10 

Poland 

53 

841007/1030 

270 

11 

Poland 

53 

841007/1030 

270 

12 

Poland 

53 

841007/1030 

270 

13* 

Baja 

California 

Sur 

25 

850125/1700 

300 

14* 

Florida 

28 

850125/1800 

300 

15* 

Florida 

27 

851006/1400 

450 

16* 

Florida 

28 

851006/1700 

450 

*  These  images  were  taken  with  the  hand-held  camera. 
**  All  times  are  approximate. 


(Of  those  images  taken  in  the  same  place  and  at 
nearly  the  same  time,  no  two  are  closer  together 
than  75  kilometers.) 
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3.0  DATA  TREATMENT 


3.1  IMAGE  DIGITIZATION 


To  accomplish  quantitative  image  analysis,  each  of  the  16  images  was  digitized.  Images 
1,  13,  and  14  were  digitized  by  a  private  firm  under  contract  with  AFGL.  The  remaining 
13  images  were  digitized  at  the  Air  Force  Wright  Aeronautics  Laboratory  (AFWAL).  The 
original  negative  images  were  digitized  using  a  linear  relationship  between  negative  opacity 
and  digital  count  values.  These  count  values  were  then  expanded  or  contracted  to  match  the 
256  count  range  of  the  image  display  system.  Reversal  of  the  digital  count  values  provided 
a  positive  digital  image.  Throughout  the  entire  imaging  process,  linearity  between  the 
original  scene  and  its  digital  representation  was  preserved  although  contrast  was  changed. 
This  contrast  manipulation  did  not  affect  brightness  relationships  within  the  image.  An 
area  appearing  twice  as  bright  as  another  in  the  original  scene  would  still  appear  twice  as 
bright  in  the  final  digitized  image. 

The  extremely  high  resolution  negative  images  were  not  digitized  at  full  resolution  due  to 
equipment  limitations  and  prohibitive  expense.  Table  3.1  lists  the  resolutions  of  the  digitized 
images.  Resolution  of  the  digitized  hand-held  imagery  was  determined  by  counting  the 
number  of  pixels  constituting  a  surface  feature  of  known  dimension.  In  contrast,  resolution 
of  the  digitized  large  format  imagery  was  determined  by  knowing,  in  advance,  the  dimensions 
of  the  surface  area  covered  by  the  imaging  system.  These  dimensions  were  dictated  by  image 
collection  system  and  orbital  characteristics. 

3.2  IMAGE  CROPPING 

The  digitized  images  were  displayed  on  the  CIRA  (C  'operative  Institute  for  Research 
in  the  Atmosphere)  COMTAL  image  processing  system  and  reviewed  for  applicability  to 
this  study.  The  three  images  digitized  under  contract  exhibited  some  noisy  values  near  the 


TABLE  3.1  DIGITIZED  IMAGE  SIZES  AND  RESOLUTIONS 


IMAGE 

NUMBER 

ORIGINAL 

IMAGE 

SIZE 

(pixels) 

CROPPED 

IMAGE 

SIZE 

(pixels) 

IMAGE 

RESOLUTION 

(m/pixel) 

AREA  OF 
CROPPED 
IMAGE 
(square  km) 

1 

512  x  390 

157  x  241 

235 

2089.548 

2 

2048  x  2048 

512  x  345 

99 

1731.249 

3 

2048  x  2048 

512  x  512 

99 

2569.273 

4 

2048  x  2048 

512  x  512 

49 

629.408 

5 

2048  x  2048 

512  x  512 

49 

629.408 

6 

2048  x  2048 

310  x  240 

49 

178.634 

7 

2048  x  2048 

512  x  512 

49 

629.408 

8 

2048  x  2048 

512  x  512 

49 

629.408 

9 

2048  x  2048 

512  x  512 

99 

2569.273 

10 

2048  x  2048 

512  x  512 

99 

2569.273 

11 

2048  x  2048 

512  x  512 

99 

2569.273 

12 

2048  x  2048 

512x512 

99 

2569.273 

13 

270  x  256 

241  x  240 

300 

5205.600 

14 

270  x  256 

241  x  240 

225 

2928.150 

15 

2048  x  2048 

260  x  365 

65 

400.953 

15 

2048  x  2048 

463  x  373 

58 

580.959 

image  edges.  The  13  images  digitized  by  AFWAL,  each  2048  x  2048  pixels,  were  too  large 
to  display  on  the  image  processing  system.  After  eliminating  noisy,  extraneous,  and  large 
overcast  and  dear  areas,  16  images  of  no  greater  size  than  512  x  512  pixels  remained.  Table 
3.1  provides  the  sizes  of  the  original  and  cropped  images. 

3.3  IMAGE  THRESHOLDING 

After  digitizing  and  cropping  the  images,  choice  of  a  threshold  was  necessary  to  dis¬ 
tinguish  doudy  and  dear  areas.  Histograms  of  the  first  three  images  examined  exhibited 
strong  bimodality.  The  peak  at  lower  reflectance  values  in  each  histogram  corresponded 
to  the  background  while  the  peak  at  higher  reflectance  values  corresponded  to  areas  well 
within  douds.  A  pronounced  valley  separated  these  two  peaks.  Table  3.2  shows  at  what 
reflectance  values  the  two  marimn  and  the  dividing  minimum  occurred  for  images  1,  13, 
and  14.  Figure  3.1  illustrates  these  features  for  image  13. 


TABLE  3.2  HISTOGRAM  MAXIMA  AND  MINIMA 


IMAGE 

NUMBER 

REFLECTANCE 
AT  FIRST 
MAXIMUM 

REFLECTANCE 
AT  DIVIDING 
MINIMUM 

REFLECTANCE 
AT  SECOND 
MAXIMUM 

1 

74 

128 

186 

13 

9C 

147 

187 

14 

131 

159 

200 

Knowing  that  the  first  peaks  represented  clear  areas  and  the  second  peaks  represented 
douds,  thresholding  the  images  at  the  dividing  minima  was  considered.  In  these  cases, 
however,  that  tactic  underestimated  the  actual  doud  cover.  The  dividing  minima  values 
(and  those  just  below  them)  corresponded  to  pixels  within  doud  edges.  This  is  plausible 
when  considering  that  doud  edges  normally  occupy  less  image  area  than  either  main  doud 
bodies  or  dear  areas. 
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Having  determined  representative  values  for  cloud  edge  and  background  reflectances,  a 
thresholding  technique  discussed  by  Billingsley  (1982)  and  Wielicki  and  Welch  (1986)  was 
chosen.  The  reflectance  threshold,  Rt,  is  given  by 

Rt  =  ^4^,  (3-1) 

A 

where  Ro  is  the  average  reflectance  of  a  completely  cloud  filled  pixel  within  the  cloud  edge 
and  R,  is  the  dear-sky  background  reflectance. 

This  thresholding  method  worked  very  well  for  the  three  images  listed  in  Table  3.2; 
however,  most  of  the  13  remaining  images  did  not  exhibit  bimodal  reflectance  histograms. 
In  these,  there  were  only  pronounced  maxima  corresponding  to  the  background  reflectances, 
with  higher  count  values  nearly  uniformly  distributed.  To  solve  the  problem  of  locating  the 
second  peak  in  these  histograms,  smaller  sections  of  each  image  that  exhibited  slightly 
higher  doudy  fractions  were  examined.  Figure  3.2  shows  histograms  of  image  9  before  and 
after  inspection  of  a  smaller  doudy  section. 

As  shown  in  Figure  3.2b,  these  image  sections  did  yidd  secondary  histogram  peaks 
corresponding  to  doudy  regions,  but  no  sharp  minima  corresponding  to  doud  edges  was 
present.  This  suggested  determination  of  the  Billingsley  threshold  in  relation  to  the  two 
histogram  peaks  rather  than  a  first  peak  and  dividing  minimum. 

Four  sections  of  each  bimodal  image  (i.e.,  a  total  of  12  subsections)  were  examined 
and  the  Billingsley  threshold  determined.  The  average  fractional  distance  at  which  the 
thresholds  occurred  between  the  two  maxima  of  each  histogram  was  .254.  The  Billingsley 
threshold  expression  was  then  altered  to  read 

Rt  =  R,  +  .254 {Ro  -  Rt).  (3.2) 

Using  this  thresholding  method,  all  16  images  were  treated  with  consistency. 

Histograms  of  no  less  than  two  image  subsections  were  used  to  determine  each  image 
threshold.  Subsection  thresholds  were  averaged  and  applied  to  the  whole.  Pixels  with 
reflectance  at  or  above  these  thresholds  were  considered  entirely  cloudy  and  shaded  white. 
Pixels  with  reflectance  values  below  these  thresholds  were  considered  entirely  clear  and 
shaded  black.  All  subsequent  image  manipulations  were  completed  using  these  binary  (i.e., 
black  and  white)  images. 
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Appendix  A  contains  copies  of  all  16  images,  the  subsections  of  those  images  treated  in 
this  study,  and  the  image  subsection  after  threshold  application. 

3.4  COUNTING  ALGORITHM 

The  primary  purpose  of  previous  counting  algorithms  (Malick  et  al.,  1979;  Snow  et  al., 
1985b)  was  to  determine  the  probability  that  clear  intervals  of  any  given  length  existed  in 
an  image.  (For  simplicity,  only  the  clear  case  is  discussed  at  this  point.  Of  course,  any 
discussion  applies  equally  to  the  cloudy  interval  lengths.)  These  algorithms  were  written  to 
start  at  the  upper  left  corner  of  an  image  and  read  from  left  to  right  until  reaching  the  end 
of  the  line.  The  second  line  was  then  read  from  left  to  right  and  the  program  progressed 
in  this  fashion  until  the  lower  right  corner  of  the  image  was  reached.  This  treatment  of  the 
image  was  meant  to  simulate  the  scanning  pattern  used  by  satellite  radiometers. 

Counting  all  the  way,  the  program  tallied  the  number  of  clear  intervals  of  each  length 
encounterd.  This  information  constituted  the  number  density  of  the  dear  intervals.  The 
number  density  values  were  then  divided  by  the  total  number  of  dear  intervals  to  yidd  the 
probability  density.  This  represented  Pr(X=x)  for  dear  interval  lengths. 

A  problem  occurs  when  intervals  at  the  image  edges  are  not  carefully  treated.  These 
artifidally  shortened  segments  can  bias  any  number  or  probability  densities  computed.  In 
the  described  algorithms,  these  truncated  intervals  were  tallieu  along  with  all  the  others. 

A  primary  aim  in  the  design  of  this  study’s  counting  algorithm  was  to  eliminate  the 
inappropriate  treatment  of  these  artificially  shortened  intervals.  This  was  accomplished  by 
ignoring  the  first  dear  interval  of  each  image  line  and  any  cloudy  interval  before  it,  and 
ignoring  the  last  doudy  interval  in  each  line  and  any  dear  interval  following  it.  This  ensured 
that  the  image  used  for  statistical  analysis  consisted  of  an  equal  number  of  cloudy  and 
dear  intervals,  none  of  which  were  artifidally  shortened.  Having  eliminated  “undesirable” 
intervals  the  algorithm  was  used  to  generate  the  desired  doud  fidd  statistics. 

To  this  point,  with  the  exception  of  truncated  interval  treatment,  this  study’s  counting 
algorithm  dosdy  followed  those  used  previously.  But,  in  the  present  study  the  probability 
that  a  dear  interval  exceeded  any  given  value  [Pr(X>x)j  was  also  of  interest.  The  program 


used  here  just  as  easily  tallied  the  numbers  of  clear  intervals  exceeding  each  possible  value 
in  the  distribtion  and  then  divided  once  more  by  the  total  number  of  clear  intervals. 

The  primary  output  of  the  counting  algorithm,  then,  was  four  files  (two  for  the  cloudy 
case  and  two  for  the  clear  case).  The  first  and  third  files  contained  the  probabilities  that  a 
randomly  chosen  cloudy  or  clear  interval  would  equal  any  particular  length  [Pr(X=x)].  The 
second  and  fourth  files  contained  the  probabilities  that  a  randomly  chosen  cloudy  or  clear 
interval  would  equal  or  exceed  any  chosen  length  [Pr(X>x)].  Recall  that  functions  used  to 
model  these  probabilities  are  the  PDF  and  1-CDF. 

The  algorithm  also  computed  the  mean  cloudy  and  clear  interval  lengths.  To  accomplish 
this,  the  total  number  of  cloudy  or  clear  pixels  was  divided  by  the  number  of  cloudy  or  clear 
intervals. 

3.5  MODEL  CHOICES 

Prospective  models  for  the  PDF  and  1-CDF  were  chosen  after  viewing  plots  (on  linearly 
scaled  paper)  of  the  actual  probability  distributions.  Figure  3.3  shows  these  plots  for  cloudy 
intervals  of  image  1. 

Plots  for  the  other  15  images  wtere  quite  similar  in  shape  with  short  interval  length 
probabilities  considerably  higher  than  those  of  long  intervals.  Contributions  to  these  high 
probabilities  were  made  by  both  small  clouds  and  small  appendages  extending  from  larger 
clouds. 

After  viewing  all  the  plots,  model  choices  were  narrowed  to  exponential,  hyperbolic,  and 
logarithmic  functions.  These  functions  were  particularly  convenient  to  work  with  because 
they  are  easily  linearized,  simplifying  subsequent  regressions.  Table  3.3  shows  the  functions 
and  their  linear  transforms.  In  Table  3.3,  x  represents  the  chosen  cloudy  interval  length  and 
Y  represents  the  modeled  Pr(X=x)  or  Pr(X>x),  depending  on  whether  the  PDF  or  1-CDF 
is  being  considered. 

Only  (3  needed  to  be  estimated  before  model  probabilities  were  compared  with  the 
observed  values.  To  estimate  /?,  actual  values  of  Y  and  X  (or  functions  of  these)  were 
compared  by  linear  regression  using  the  linear  transforms  of  the  1-CDFs  shown  in  Table  3.3. 
The  regression  algorithm  minimized  the  sum  of  the  absolute  deviations  about  the  regression 


Figure  3.3  PROBABILITY  DISTRIBUTIONS  FOR  IMAGE  1 


TABLE  3.3  PROSPECTIVE  MODEL  FUNCTIONS 


MODEL* 

PDF 

PDF 

(linear  transform) 

1-CDF 

1-CDF 

(linear  transform) 

exp 

Y=-/3exp(/3x) 

ln(Y)=ln(-/3)+/3x 

Y=exp(/3x) 

ln(Y)=0+/3x 

hyp 

Y=-/?exp(/3-l) 

ln(Y)=ln(-/3)+(/3-l)ln(x) 

Y=x^ 

In  (Y)=0+/?ln(x) 

log 

x 

i 

II 

>* 

Y=0+(-/3/x) 

Y=/31n(x) 

Y=0+/31n(x) 

exp  —  exponential,  hyp  =  hyperbolic,  log  =  logarithmic 


line  to  determine  /3  rather  than  minimizing  the  sum  of  the  squared  deviations.  (NOTE: 
Because  the  /3s  were  determined  using  transforms  of  the  original  equations,  there  was  no 
guarantee  that  errors  would  still  be  minimal  when  respective  (3  values  were  used  in  the 
original  equations.) 

3.6  GOODNESS-OF-FIT 

Values  of  /3  determined  by  regression  were  then  placed  into  the  original  model  equations 
for  PDF  and  1-CDF  and  compared  to  the  observed  probabilities.  This  comparison  was 
made  losing  a  goodness-of-fit  statistic  based  on  least  absolute  deviation  (See  Appendix  B.). 
The  goodness-of-fit  statistic,  G,  was  computed  by 

G  =  (l/n)X>-n  (3-3) 

i=i 

where  n  is  the  number  of  observations  (i.e.,  length  categories),  Y  is  the  observed  probability 
value,  and  Y  is  the  model  estimate  of  the  observed  value. 

Table  3.4  shows  the  number  of  observations  used  for  each  model-to-observation  compar¬ 
ison.  As  an  illustrative  example,  consider  image  1.  Cloudy  intervals  range  in  length  from 
1  to  71  pixels  and  clear  intervals  from  1  to  49  pixels.  These  sire  the  number  of  cases  for 
which  values  of  the  1-CDF  can  be  computed.  But  when  the  PDF  is  considered  for  each 
case,  this  number  decreases.  Even  though  there  are  cloudy  intervals  from  as  short  as  1 
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TABLE  3.4  NUMBER  OF  OBSERVATIONS  USED  IN  COMPARISONS 


IMAGE 

NUMBER 

Pr(X=x) 

OR 

PDF 

FOR 

CLOUDY 

CASE 

Pr(X>x) 

OR 

1-CDF 

FOR 

CLOUDY 

CASE 

Pr(X=X) 

OR 

PDF 

FOR 

CLEAR 

CASE 

Pr(X>X) 

OR 

1-CDF 

FOR 

CLEAR 

CASE 

1 

38 

71 

38 

49 

2 

110 

137 

133 

218 

3 

72 

104 

241 

418 

4 

134 

173 

167 

413 

5 

163 

269 

121 

141 

6 

54 

77 

63 

114 

7 

161 

215 

181 

287 

8 

127 

216 

200 

275 

9 

159 

193 

112 

159 

10 

91 

132 

223 

418 

11 

94 

115 

229 

352 

12 

83 

90 

150 

178 

13 

45 

50 

42 

58 

14 

104 

179 

41 

89 

15 

91 

116 

76 

105 

16 

48 

66 

134 

254 

TOTAL* 

1574 

2203 

2151 

3528 

wn”  is  equal  to  these  values  in  computation  of  the 
G  statistic  for  the  16  images  as  a  group.  Other 
values  of  “n”  shown  here  are  used  in  Appendix  C 
to  compute  G  values  for  individual  images. 
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pixel  to  as  long  as  71  pixels,  not  every  length  category  between  these  is  represented  by  a 
cloudy  length.  In  the  case  of  image  1  there  are  33  cloudy  lengths  (71  minus  38)  that  are 
not  represented.  For  the  clear  case  there  are  only  11  lengths  (49  minus  38)  missing.  These 
are  simply  the  unobserved  lengths  between  shortest  and  longest  observed  values.  In  image 
1  there  are  no  clear  intervals  29,  32,  35,  38,  40,  41,  44,  45,  46,  47,  or  48  pixels  long.  Errors 
for  these  non-existent  lengths  were  not  computed  when  determining  the  goodness-of-fit  of 
the  PDF  models. 


4.0  RESULTS 


4.1  MODEL  PERFORMANCE 

Values  of  G  for  model  treatments  of  individual  images  are  presented  in  Appendix  C 
while  those  for  the  16  images  as  a  group  are  presented  in  Table  4.1. 

The  exponential  model  exhibited  a  smaller  mean  error  than  the  other  two  models  in 
every  condition/case  category.  This  agreed  with  the  findings  of  Snow  and  Willand  (1986), 
who  found  that  the  exponential  distribution  outperformed  the  lognormal  distribution  and 
two  varieties  of  the  gamma  distribution.  (Then  results  were  based  only  on  fits  of  PDFs, 
however.)  Further  examination  of  the  exponential  model  was  merited  on  the  basis  of  these 
small  errors. 

4.2  EXPONENTIAL  MODEL  INVESTIGATION 

Malick  et  al.  (1979)  were  the  first  to  suggest  that  the  mean  interval  length  (cloudy  or 
clear)  be  used  to  estimate  0  when  using  the  exponential  distribution  functions.  Snow  and 
Willand  (1986)  were  the  first  to  use  this  concept  on  actual  data.  They  estimated  0  for  the 
cloudy  case  by  multiplying  the  cloud  cover  fraction  by  the  recurrence  interval  (simply  the 
sum  of  the  mean  cloudy  and  clear  interval  lengths),  realizing  that  the  result  was  identical 
to  the  reciprocal  of  the  mean  cloudy  interval  length.  For  the  dear  case,  they  used  the  clear 
fraction  times  the  recurrence  length. 

The  counting  algorithm  used  in  this  study  computed  both  the  mean  cloudy  and  mean 
clear  interval  lengths  for  each  image.  The  reciprocals  of  these  are  the  actual  estimators  of  0. 
The  resulting  values  were  substituted  into  the  original  exponential  equations  to  determine 
whether  or  not  mean  errors  could  be  reduced. 

Table  4.2  is  a  comparison  of  exponential  model  results  using  0s  estimated  by  both 
regression  and  cloud-field  statistics  (mean  cloudy  and  dear  interval  lengths). 


TABLE  4.1  MEAN  ERRORS  OF  MODELED  PROBABILITIES 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

6.1494 

1574 

.00391** 

OR 

PDF 

hyp 

24.9089 

1574 

.01583 

log 

16.3421 

1574 

.01038 

Pr(X>x) 

exp 

74.0809 

2203 

.03363** 

OR 

1-CDF 

hyp 

145.0984 

2203 

.06584 

log 

223.3028 

2203 

.10136 

CLEAR 

Pr(X=x) 

exp 

8.6670 

2151 

.00403** 

OR 

PDF 

hyp 

23.2122 

2151 

.01079 

log 

16.3199 

2151 

.00759 

Pr(X>x) 

exp 

165.3093 

3528 

.04686** 

OR 

1-CDF 

hyp 

209.6766 

3528 

.05943 

log 

313.2807 

3528 

.08880 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  4.2  ERRORS  USING  0  FROM  REGRESSION  AND  FROM  CLOUD  FIELD 


CONDITION 

CASE 

METHOD* 

TOTAL 

NUMBER  OF 

G** 

ERROR*** 

ERROR 

OBSERVATIONS 

(mean 

REDUCTION 

error) 

(percent) 

CLOUDY 

Pr(X=x) 

regre 

6.1494 

1574 

.00391 

OR 

23 

PDF 

cloud 

4.7423 

1574 

.00301 

Pr(X>x) 

regre 

74.0809 

2203 

.03363 

OR 

1-CDF 

cloud 

42.1936 

2203 

.01915 

43 

CLEAR 

Pr(X=x) 

regre 

8.6670 

2151 

.00403 

OR 

20 

PDF 

cloud 

6.9140 

2151 

.00321 

Pr(X>x) 

regre 

165.3093 

3528 

.04686 

OR 

1-CDF 

cloud 

100.3047 

3528 

.02843 

39 

*  regre  =  determined  from  regression 

cloud  =  determined  from  cloud-field  statistics 

•*  Multiply  these  values  by  100  to  find  the  percent  error. 

***  All  error  reduction  resulted  from  use  of  0  determined  using 
cloud-field  statistics  (mean  cloudy  and  clear  interval  lengths). 


It  is  at  first  surprising  that  cloud-field  statistical  estimates  of  0  perform  better  them 
those  estimated  by  regression.  The  apparent  contradiction  is  rectified  by  recalling  that  the 
regression  minimized  error  for  the  transformed  equations,  not  the  original  equations.  (See 
NOTE  at  the  end  of  Section  3.5.) 

4.3  IMAGE  FRACTION  STATISTICS 

How  small  can  an  image  fraction  be  while  still  accurately  reflecting  the  probability 
distributions  of  the  original  image?  Snow  and  Willand  (1986)  provided  part  of  the  answer. 
They  treated  images  by  looking  at  short  strings  of  pixels,  one  after  another,  and  determining 
the  variance  of  both  cloudy  fraction  and  recurrence  length.  They  systematically  increased 
the  pixel  string  lengths  until  the  variance  stabilized  at  a  low  value.  For  the  few  images  they 
considered,  the  variance  reached  and  maintained  a  minimum  when  the  image  was  sampled 
in  successive  strings  of  about  250  or  more  pixels.  This  showed  there  was  little  variance  in 
cloud  cover  and  recurrence  length  from  one  250  pixel  string  to  the  next.  This  variability, 
even  though  low,  was  still  based  on  examination  of  the  entire  image  and  did  not  consider 
the  result  of  examining  only  one  (perhaps  the  first)  250  pixel  string. 

This  study  considered  smaller  and  smaller  fractions  of  each  image  in  preference  to  pixel 
strings  of  fixed  length.  This  was  done  to  determine  just  what  fraction  of  an  image,  rather 
than  absolute  number  of  pixels,  needs  to  be  sampled  to  accurately  represent  the  entire  image 
in  terms  of  its  mean  cloudy  or  dear  interval  length.  Initially,  the  first  half  of  an  image 
was  considered;  then,  the  first  quarter.  This  halving  of  each  successive  fractional  portion 
continued  until  the  first  1/32  of  each  image  was  treated.  Each  of  these  fractions  was  taken 
from  the  beginning  of  the  image  (as  a  satellite  might  view  it  when  passing  overhead)  so 
that  the  resulting  model  errors  would  reflect  the  inaccuracies  of  a  spatial  forecast.  Values 
of  0  were  determined  for  each  fractional  image  and  then  used  in  the  exponential  model  to 
estimate  the  probability  distributions  of  the  original,  whole  image.  Mean  errors  resulting 
from  use  of  the  fractional  0  values  were  then  computed. 

Table  4.3  presents  the  mean  errors  for  fractional  images.  As  expected,  errors  increased 
as  smaller  and  smaller  image  fractions  were  used  to  model  the  entire  image.  The  subsequent 
halving  of  the  image  fractions  was  stopped  when  two  successive  image  fractions  produced 


TABLE  4.3  MEAN  ERRORS  ASSOCIATED  WITH  FRACTIONAL  IMAGES 


CONDITION 

CASE 

IMAGE  FRACTION 
CONSIDERED 

G* 

(mean  error) 

CLOUDY 

Pr(X=x) 

1/1 

.00301 

OR 

1/2 

.00325 

PDF 

1/4 

.00345 

1/8 

.00371 

1/16 

.00375 

1/32 

.00400 

Pr(X>x) 

1/1 

.01915 

OR 

1/2 

.02492 

1-CDF 

1/4 

.03576 

1/8 

.04993 

1/16 

.05678 

1/32 

.06274 

CLEAR  Pr(X=x) 

1/1 

.00321 

OR 

PDF 

1/2 

1/4 

1/8 

1/16 

1/32 

.00389 

.00358 

.00336 

.00359 

.00408 

Pr(X>x) 

1/1 

.02843 

OR 

1/2 

.03668 

1-CDF 

1/4 

.05223 

1/8 

.03684 

1/16 

.05128 

1/32 

.05914 

*  Multiply  these  values  by  100  to  find  the  percent  error. 


5.0  SUMMARY  AND  CONCLUSIONS 


5.1  SUMMARY 

This  study  used  16  very  high  resolution,  digitzed  photographic  images  collected  during 
4  space  shuttle  orbiter  missions.  They  were  used  to  determine  a  realistic  model  of  the  PDF 
and  1-CDF  of  cloudy  and  clear  interval  lengths  within  broken  cloud-fields.  Three  models 
were  tested:  exponential,  hyperbolic,  and  logarithmic. 

Image  fractions  as  small  as  1  /32  were  used  to  estimate  the  character  of  the  entire  cloud- 
field.  These  spatial  forecasts  and  the  interval  probability  models  were  tested  using  statistical 
methods  based  on  least  absolute  deviations. 

5.2  CONCLUSIONS 

The  major  conclusions  of  this  study  are  as  follows: 

1.  The  use  of  the  dividing  minimum  reflectance  value,  between  the  two  bimodal  his¬ 
togram  maxima  of  a  visible  image,  as  a  threshold  between  cloudy  and  clear  regions  is 
questionable.  Pixels  of  this  and  slightly  lower  reflectance  values  fell  within  cloud  edges 
causing  many  cloudy  pixels  to  be  classified  as  dear.  A  thresholding  technique  loosely  based 
on  that  of  Billingsley  (1982)  was  proposed.  It  placed  the  threshold  at  approximately  one 
quarter  of  the  distance  between  the  low  and  high  value  peaks  of  the  histogram.  (The  tech¬ 
nique  worked  very  well  in  this  study;  however,  it  was  developed  using  only  a  small  number 
of  very  high  resolution  visible  images.) 

2.  Exponential  models  can  accurately  represent  observed  probabilities  of  doudy  and 
dear  interval  lengths  within  cumuli  form  doud  fidds  (see  Table  4.1).  Very  good  estimates 
of  the  exponential  distribution  parameters  are  provided  simply  by  the  inverses  of  the  mean 
doudy  and  mean  dear  interval  lengths  (see  Table  4.2). 


3.  Image  fractions  as  small  as  the  first  1/32  can  estimate,  with  reasonable  accuracy, 
the  PDF  and  1-CDF  of  the  cloudy  and  clear  interval  lengths  of  the  entire  image  (see  Table 
4.3).  In  other  words,  a  form  of  spatial  forecast  can  be  made  of  what  probabilities  will  occur 
in  an  area  larger  than  that  actually  sampled. 

Conclusions  two  and  three  agree  with  and  extend  the  findings  of  Snow  and  Willand 
(1986).  It  is  encouraging,  in  light  of  dissimilar  statistical  and  image  edge  treatments,  that 
this  and  previous  works  agree  on  the  utility  of  the  exponential  model  in  representation  of 
cloudy  and  clear  interval  length  probabilities.  However,  the  underlying  physical  principle 
dictating  the  distribution  of  these  interval  lengths  remains  unexplained. 

If  interval  lengths  are  considered  a  measure  of  the  entropy  or  degree  of  order  of  the  cloud 
system,  an  argument  made  by  Levine  (1983)  may  be  used.  He  proposed  that  entropy  be 
defined  as  a  function  of  the  probability  of  a  state.  He  proved  that  this  function  of  probability 
must  take  the  form 

5  =  Jfe[in(Pr)]  +  a,  (5.1) 

where  5  is  entropy,  k  and  a  are  constants,  and  Pr  is  the  probability  of  the  state.  It  is  quite 
interesting  to  find  that  substitution  of  Y  and  x  (from  section  3.5)  for  Pr  and  5  of  (5.1) 
yields  an  equation  of  the  same  form  as  the  exponential  PDF  and  1-CDF. 

5.3  POSSIBLE  APPLICATIONS  AND  FUTURE  WORK 

5.3.1  Applications 

Although  many  applications  of  this  study’s  results  may  exist,  those  in  the  atmospheric 
sciences  are  of  primary  interest. 

Atmospheric  boundary  layer  characteristics  control  the  growth  of  cumuliform  clouds. 
From  one  location  to  another  and  from  one  time  to  another  these  characteristics  change, 
with  no  two  boundary  layer  air  masses  entirely  identical.  If  clouds  are  present,  the  mean 
cloudy  and  clear  interval  lengths  also  change.  It  may  be  possible  to  equate  different  values 
of  /3  (or  its  reciprocal)  with  specific  air  mass  types  or  with  single  air  masses  during  different 
phases  of  boundary  layer  development.  This  would  constitute  a  simple  parameterization  of 
boundary  layer  characteristics  that  might  prove  useful  in  atmospheric  models  of  sufficiently 
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high  resolution.  The  cloudy  and  dear  interval  lengths  could  be  treated  as  proxies  for  the 
atmospheric  variables  assodated  with  them.  This  could  allow  a  modder  to  dictate  the 
distribution  of  moisture,  vertical  motion,  etc.  within  the  model. 

Modelers  of  the  atmospheric  boundary  layer  and  convective  processes  might  use  the 
exponential  PDF  as  a  necessary  but  insuffident  condition  for  doud-fidd  realism.  Cloud 
scenes  generated  by  their  models  could  be  subjected  to  the  counting  algorithm  proposed  in 
this  paper.  Resulting  mean  cloudy  and  dear  lengths  could  be  used  to  generate  the  PDFs 
that  the  modd  doud-fidd  should  exhibit.  The  more  dosdy  the  modd  probabilities  match 
the  exponential  PDFs,  the  better  the  modd  representation  of  reality  (i.e.,  reality  as  the 
exponential  PDFs  represents  it).  While  this  constitutes  a  comparison  of  two  models,  one 
theoretical  and  one  empirical,  the  exponential  modd  has  thus  far  proven  quite  accurate  in 
its  representations  of  cloud  scenes. 

5.3.2  Future  Work 

The  space  shuttle  orbiter  is  capable  of  collecting  imagery  at  extremdy  high  resolutions 
(see  Appendix  D).  These  data  could  be  used  to  examine  atmospheric  phenomena  with  a 
combination  of  global  coverage  and  detail  not  previously  possible.  (Satellites  may  have 
greater  global  coverage  and  aerial  photographs  may  have  greater  spatial  resolution,  but  the 
space  shuttle  provides  an  unparallelled  combination  of  the  two.) 

Cumulus  and  stratocumulus  were  the  predominant  doud  types  in  this  study.  Other 
broken  doud  types  should  be  examined  to  determine  whether  or  not  their  interval  length 
probability  distributions  can  be  described  by  exponential  functions.  It  may  also  be  useful 
to  describe  how  these  distributions  change  as  azimuthal  scanning  angle  is  varied  for  single 
images. 

Currently,  accurate  description  of  doud  size  distributions  is  possible,  but  two-dimen¬ 
sional  representation  of  spaces  separating  those  douds  is  difficult.  Consequently,  anyone 
interested  in  doud-fidd  representations  may  know  the  sizes  of  douds  but  not  know  how- 
far  apart  to  place  them.  This  study  has  shown  that  both  doudy  and  dear  spaces  can  be 
accurately  characterized  in  one  dimension.  If  one-  and  two-dimensional  representations  can 
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be  directly  related,  perhaps  more  accurate  two-dimensional  cloud-field  characterizations 


could  be  accomplished. 


w 
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APPENDIX  A 


IMAGERY: 

This  section  provides  the  reader  with  prints  of  the  imagery  used  in  this  study.  The 
prints  were  made  using  a  Talaris  800  Laserprinter.  The  input  to  the  printer  was  provided 
by  the  CIRA  COMTAL  image  processing  system.  Having  undergone  several  manipulations, 
the  images  no  longer  display  their  orignal  resolution.  The  resolution  of  the  printed  images 
is  four  times  more  coarse  than  when  they  were  displayed  on  the  COMTAL  imaging  system. 

Each  print  is  divided  into  four  quadrants.  The  contents  of  each  quadrant  is  explained 
below. 

UPPER  LEFT  QUADRANT:  This  image  is  shown  as  it  was  displayed  on  the  COMTAL 
imaging  system.  For  the  2048  x  2048  pixel  images,  this  is  the  512  x  512  section  selected 
from  it. 

UPPER  RIGHT  QUADRANT:  This  image  is  the  cropped  image  and  is  a  subsection  of 
the  image  in  the  upper  left  quadrant.  If  the  image  was  not  cropped,  it  appears  exactly  as 
it  did  in  the  upper  left  quadrant. 

LOWER  RIGHT  QUADRANT:  This  image  exactly  matches  the  one  above  it  except 
that  the  cloudy/clear  threshold  has  been  applied.  Clouds  appear  white  and  clear  areas 
appear  black. 

LOWER  LEFT  QUADRANT:  Along  with  pixel  resolution,  here  are  some  image  statis¬ 
tics  not  presented  elsewhere  in  the  paper. 


resolution: 

235  meters /pixel 

cropped  image  size: 
36.9  x  56.6  km 

mean  cloudy  length: 
1.23  km 

mean  clear  length: 
1.08  km 
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Figure  A.3  TREATMENTS  AND  STATISTICS  OF  IMAGE  3 
(See  Appendix  A  text  for  figure  explanation.) 


resolution: 

49  meters/pixel 


cropped  image  size 
25.1  x  25.1  km 

mean  cloudy  length 
0.87  km 

mean  clear  length: 
1.05  km 


resolution: 

49  meters/pixel 

cropped  image  size: 
25.1  x  25.1  km 

mean  cloudy  length: 
0.51  km 

mui  clear  length: 
0.54  km 


Figure  A.6  TREATMENTS  AND  STATISTICS  OF  IMAGE  6 
(See  Appendix  A  text  for  figure  explanation.) 
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resolution: 

99  meters/pixel 


cropped  image  size: 
50.7  x  50.7  km 

mean  cloudy  length: 
2.02  km 

mean  clear  length: 
1.56  km 


Figure  A.9  TREATMENTS  AND  STATISTICS  OF  IMAGE  9 
(See  Appendix  A  text  for  figure  explanation.) 


Figure  A.ll  TREATMENTS  AND  STATISTICS  OF  IMAGE  11 
(See  Appendix  A  text  for  figure  explanation.) 
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resolution: 

58  meters/pixel 

cropped  image  size: 
26.9  x  21.6  km 

mean  cloudy  length: 
0.43  km 

mean  clear  length: 
0.83  km 


APPENDIX  B 


STATISTICAL  TREATMENT: 

The  intent  of  this  research  was  to  successfully  model  cloudy  and  clear  interval  length 
PDFs  and  1-CDFs.  Before  prospective  models  could  be  treated,  a  statistical  analysis  method 
had  to  be  chosen.  The  method  would  be  used  to  perform  regressions  and  to  determine  the 
goodness-of-fit  of  the  models  to  the  data. 

It  is  desirable  that  any  statistical  analysis  method  be  both  metric  and  congruent.  As 
pointed  out  by  Mielke  (1986),  a  method  is  metric  if  (i)  Dij  >  0  and  D^i  =  0,  (ii)  Ditj  =  Dji 
(symmetry),  and  (iii)  Dij  +  Djj,  >  D^k  (triangle  inequality).  The  first  conditions  state  that 
the  distance  between  two  points  must  be  greater  than  or  equal  to  zero  and  that  the  distance 
from  a  point  to  itself  is  zero.  The  second  condition  states  that  the  distance  measured 
between  two  points  is  the  same  regardless  of  which  point  is  considered  first.  The  third 
condition  states  that  the  sum  of  any  two  sides  of  a  triangle  must  be  greater  than  or  equal 
to  the  measure  of  the  third.  These  conditions,  if  met,  aid  in  the  definition  of  a  Euclidean 
space. 

The  domain  of  observations  between  which  comparisons  are  made  (the  data  space)  is 
ususally  perceived  as  a  Euclidean  space  and  is,  therefore,  metric.  It  is  ideal  to  attain 
congruence  between  the  data  space  and  analysis  space.  This  requires  that  the  analysis 
space  be  metric  as  well.  This  often  goes  unrecognized  as  evidenced  by  the  frequent  use  of 
least  squares  analysis  methods.  The  analysis  space  of  these  methods  is  incongruent  in  that 
it  does  not  match  any  Euclidean  data  space  and  non-metric  because  it  does  not  satisfy  the 
triangle  inequality.  Least  squares  methods  also  require  that  the  variance  be  homogeneous 
and  normally  distributed  about  the  mean.  For  all  these  reasons,  least  squares  methods  were 
abandoned 

There  are  statistical  analysis  methods  that  satisfy  the  metric  and  congruent  require¬ 
ments.  They  are  based  on  least  absolute  deviations  rather  than  least  squared  deviations. 
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Treatment  of  the  data  space  using  these  methods  is  in  keeping  with  our  intuitive  Euclidean 
perceptions.  Least  absolute  methods  were  used  in  Chapter  3. 


APPENDIX  C 


MEAN  ERRORS  FOR  INDIVIDUAL  IMAGES: 

These  tables  present  G  values  (mean  errors)  for  every  condition/case  combination  for 
every  image.  The  exponential  model  is  outperformed  in  only  3  of  64  instances.  In  these 
instances  the  hyperbolic  model  accurately  treats  the  CLEAR/  1-CDF  CONDITION/CASE. 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.3694 

104 

.0036** 

OR 

PDF 

hyp 

2.0435 

104 

.0196 

log 

1.0015 

104 

.0096 

Pr(X>x) 

exp 

2.0453 

179 

.0114** 

OR 

1-CDF 

hyp 

3.2768 

179 

.0183 

log 

5.1769 

179 

.0289 

CLEAR 

Pr(X=x) 

exp 

.2897 

41 

.0071** 

OR 

PDF 

hyp 

log 

1.8508 

1.0051 

41 

41 

.0451 

.0245 

Pr(X>x) 

exp 

1.0330 

89 

.0116** 

OR 

1-CDF 

hyp 

2.5703 

89 

.0289 

log 

4.5366 

89 

.0510 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  C.2  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  2 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.3745 

110 

.0034** 

OR 

PDF 

hyp 

1.3321 

110 

.0121 

log 

1.0534 

110 

.0096 

Pr(X>x) 

exp 

5.8057 

137 

.0424** 

OR 

1-CDF 

hyp 

9.4241 

137 

.0688 

log 

16.7178 

137 

.1220 

CLEAR 

Pr(X=x) 

exp 

.4491 

133 

.0034** 

OR 

PDF 

hyp 

1.3543 

133 

.0102 

log 

1.0351 

133 

.0078 

Pr(X>x) 

exp 

11.3177 

218 

.0519** 

OR 

1-CDF 

hyp 

13.2375 

218 

.0607 

log 

22.0372 

218 

.1011 

*  Multiply  these  values  by  100  to  find  the  percent  error. 
**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 
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TABLE  C.3  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  3 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.2865 

72 

.0040** 

OR 

PDF 

hyp 

1.6511 

72 

.0229 

log 

1.0167 

72 

.0141 

Pr(X>x) 

exp 

1.5171 

104 

.0146** 

OR 

1-CDF 

hyp 

8.5522 

104 

.0822 

log 

12.5533 

104 

.1207 

CLEAR 

Pr(X=x) 

exp 

6500 

241 

.0027** 

OR 

PDF 

hyp 

log 

1.1510 

1.0471 

241 

241 

.0048 

.0043 

Pr(X>x) 

exp 

26.3525 

418 

.0630** 

OR 

1-CDF 

hyp 

27.3945 

418 

.0655 

log 

43.0012 

418 

.1029 

*  Multiply  these  values  by  100  to  find  the  percent  error. 
**  These  models  performed  best  in  their  respective 
CONDITION /CASE  combinations. 


TABLE  C.4  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  4 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.5232 

134 

.0039** 

OR 

PDF 

hyp 

1.1601 

134 

.0089 

log 

1.0401 

134 

.0078 

Pr(X>x) 

exp 

7.6218 

173 

.0441** 

OR 

1-CDF 

hyp 

8.6531 

173 

.0500 

log 

16.1890 

173 

.0936 

CLEAR 

Pr(X=x) 

exp 

.8893 

167 

.0053** 

OR 

PDF 

hyp 

log 

1.4772 

1.0023 

167 

167 

.0088 

.0060 

Pr(X>x) 

exp 

26.1830 

413 

.0634** 

OR 

1-CDF 

hyp 

16.0812 

413 

.0389 

log 

20.7773 

413 

.0503 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  C.5  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  5 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.4661 

163 

.0029** 

OR 

PDF 

hyp 

1.5283 

163 

.0094 

log 

1.0091 

163 

.0062 

Pr(X>x) 

exp 

8.1467 

269 

.0303** 

OR 

1-CDF 

hyp 

17.4624 

269 

.0649 

log 

22.9170 

269 

.0852 

CLEAR 

Pr(X=x) 

exp 

.5476 

121 

.0045** 

OR 

PDF 

hyp 

log 

1.0653 

1.0433 

121 

121 

.0088 

.0086 

Pr(X>x) 

exp 

5.6682 

141 

.0402** 

OR 

1-CDF 

hyp 

8.9198 

141 

.0633 

log 

15.6746 

141 

.1112 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  C.6  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  6 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.4193 

54 

.0078** 

OR 

* 

PDF 

hyp 

1.3693 

54 

.0254 

log 

1.0286 

54 

.0190 

Pr(X>x) 

exp 

1.9194 

77 

.0249** 

OR 

1-CDF 

hyp 

5.7590 

77 

.0748 

log 

9.8395 

77 

.1278 

CLEAR 

Pr(X=x) 

exp 

.6537 

63 

.0104** 

OR 

PDF 

hyp 

1.3181 

63 

.0209 

log 

1.0157 

63 

.0161 

Pr(X>x) 

exp 

5.9228 

114 

.0520** 

OR 

1-CDF 

hyp 

6.2155 

114 

.0545 

log 

10.3621 

114 

.0909 

*  Multiply  these  values  by  100  to  find  the  percent  error. 
**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 
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TABLE  C.7  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  7 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.5877 

161 

.0037** 

OR 

PDF 

hyp 

1.2357 

161 

.0077 

log 

1.0163 

161 

.0063 

Pr(X>x) 

exp 

9.1461 

215 

.0425** 

OR 

1-CDF 

hyp 

13.0889 

215 

.0609 

log 

20.8564 

215 

.0970 

CLEAR 

Pr(X=x) 

exp 

.7761 

181 

.0043** 

OR 

PDF 

hyp 

1.1291 

181 

.0062 

log 

1.0165 

181 

.0056 

Pr(X>x) 

exp 

15.3357 

287 

.0534 

OR 

1-CDF 

hyp 

13.5228 

287 

.0471** 

log 

20.7489 

287 

.0723 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 
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TABLE  C.8  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  8 


CONDITION  CASE  MODEL  TOTAL  NUMBER  OF  G*  (mean 

ERROR  OBSERVATIONS  error) 


CLOUDY 


=x) 

Q 

exp 

.5543  ] 

IF 

hyp 

1.3901  ] 

log 

1.0121  ] 

10.3378 

15.7208 


CLEAR 


wv 


;\*«\V,Vy 


.0044** 


.0038** 


TABLE  C.9  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  9 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.4239 

159 

.0027** 

OR 

PDF 

hyp 

1.3984 

159 

.0088 

log 

1.0241 

159 

.0064 

Pr(X>x) 

exp 

6.9153 

193 

.0358** 

OR 

1-CDF 

hyp 

12.5797 

193 

.0652 

log 

19.0751 

193 

.0988 

CLEAR 

Pr(X=x) 

exp 

.5087 

112 

.0045** 

OR 

PDF 

hyp 

1.5275 

112 

.0136 

log 

1.0090 

112 

.0090 

Pr(X>x) 

exp 

3.6541 

159 

.0230** 

OR 

1-CDF 

hyp 

11.4825 

159 

.0722 

log 

15.5222 

159 

.0976 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  C.10  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  10 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G *  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.3393 

91 

.0037** 

OR 

PDF 

hyp 

1.6636 

91 

.0183 

log 

1.0156 

91 

.0112 

Pr(X>x) 

exp 

4.2427 

132 

.0321** 

OR 

1-CDF 

hyp 

9.0050 

132 

.0682 

log 

13.4479 

132 

.1019 

CLEAR 

Pr(X=x) 

exp 

.4223 

223 

.0019** 

OR 

PDF 

hyp 

1.4926 

223 

.0067 

log 

1.0215 

223 

.0046 

Pr(X>x) 

exp 

15.2218 

418 

.0364** 

OR 

1-CDF 

hyp 

29.4698 

418 

.0705 

log 

40.1674 

418 

.0961 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  C.ll  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  11 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.2036 

94 

.0022** 

OR 

PDF 

hyp 

1.5248 

94 

.0162 

log 

1.0343 

94 

.0110 

Pr(X>x) 

exp 

2.3883 

115 

.0208** 

OR 

1-CDF 

hyp 

11.8123 

115 

.1027 

log 

17.2083 

115 

.1496 

CLEAR 

Pr(X=x) 

exp 

.7210 

229 

.0031** 

OR 

PDF 

hyp 

1.2036 

229 

.0053 

log 

1.0305 

229 

.0045 

Pr(X>x) 

exp 

14.6087 

352 

.0415** 

OR 

1-CDF 

hyp 

26.5452 

352 

.0754 

log 

37.6494 

352 

.1070 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  C.12  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  12 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.2148 

83 

.0026** 

OR 

PDF 

hyp 

1.4919 

83 

.0180 

log 

1.0263 

83 

.0124 

Pr(X>x) 

exp 

.7901 

90 

.0088** 

OR 

1-CDF 

hyp 

7.7721 

90 

.0864 

log 

11.9415 

90 

.1327 

CLEAR 

Pr(X=x) 

exp 

.4609 

150 

.0031** 

OR 

PDF 

hyp 

1.3890 

150 

.0093 

log 

1.0212 

150 

.0068 

Pr(X>x) 

exp 

4.5985 

178 

.0258** 

OR 

1-CDF 

hyp 

14.6900 

178 

.0825 

log 

20.5540 

178 

.1155 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 


TABLE  C.13  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  13 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.1987 

45 

.0044** 

OR 

PDF 

hyp 

1.7347 

45 

.0385 

log 

1.0242 

45 

.0228 

Pr(X>x) 

exp 

1.5423 

50 

.0308** 

OR 

1-CDF 

hyp 

5.3337 

50 

.1067 

log 

8.2453 

50 

.1649 

CLEAR 

Pr(X=x) 

exp 

.1094 

42 

.0026** 

OR 

PDF 

hyp 

1.8956 

42 

.0451 

log 

1.0082 

42 

.0240 

Pr(X>x) 

exp 

.8236 

58 

.0142** 

OR 

1-CDF 

hyp 

4.5904 

58 

.0791 

log 

6.9515 

58 

.1199 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 
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TABLE  C.14  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  14 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.5340 

104 

.0051** 

OR 

PDF 

hyp 

1.6078 

104 

.0155 

log 

1.0092 

104 

.0097 

Pr(X>x) 

exp 

7.6293 

179 

.0426** 

OR 

1-CDF 

hyp 

7.8503 

179 

.0436 

log 

12.2218 

179 

.0683 

CLEAR 

Pr(X=x) 

exp 

.4471 

41 

.0109** 

OR 

PDF 

hyp 

log 

2.1281 

1.0003 

41 

41 

.0519 

.0244 

Pr(X>x) 

exp 

2.7194 

89 

.0306** 

OR 

1-CDF 

hyp 

3.1798 

89 

.0357 

log 

5.0265 

89 

.0555 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 
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TABLE  C.15  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  15 


*  Multiply  these  values  by  100  to  find  the  percent  error. 
**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 
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TABLE  C.16  MEAN  ERRORS  OF  MODELED  PROBABILITIES  FOR  IMAGE  16 


CONDITION 

CASE 

MODEL 

TOTAL 

NUMBER  OF 

G*  (mean 

ERROR 

OBSERVATIONS 

error) 

CLOUDY 

Pr(X=x) 

exp 

.2351 

48 

.0049** 

OR 

PDF 

hyp 

2.3885 

48 

.0498 

log 

1.0024 

48 

.0209 

Pr(X>x) 

exp 

1.3048 

66 

.0198** 

OR 

1-CDF 

hyp 

5.4727 

66 

.0829 

log 

7.4080 

66 

.1122 

CLEAR 

Pr(X=x) 

exp 

.5597 

134 

.0042** 

OR 

PDF 

hyp 

1.8827 

134 

.0141 

log 

1.0035 

134 

.0075 

Pr(X>x) 

exp 

11.8303 

254 

.0466 

OR 

1-CDF 

hyp 

10.2144 

254 

.0402** 

log 

14.0700 

254 

.0554 

*  Multiply  these  values  by  100  to  find  the  percent  error. 

**  These  models  performed  best  in  their  respective 
CONDITION/CASE  combinations. 
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APPENDIX  D 


EXTREMELY  HIGH  RESOLUTION  IMAGERY  FROM  THE  SPACE  SHUTTLE 
ORBITER: 

Imagery  of  the  cloudy  atmosphere  is  an  immensely  useful  tool  to  the  atmospheric  scien¬ 
tist.  The  space  shuttle  orbiter  is  a  largely  untapped  resource  of  extremely  high  resolution 
imagery  and  is  available  using  existing  technology. 

The  potential  surface  resolution  (also  called  resolving  power)  of  a  space-bome  imaging 


system,  R^,  can  be  estimated  by 


where  H  is  the  altitude  of  the  system,  F  is  the  lens  focal  length,  and  R ^  is  the  system 
resolution  (Jensen,  1968).  Scene  contrast,  image  motion,  and  atmospheric  conditions  can 
also  affect  surface  resolution,  but  equation  (D.l)  is  a  useful  measure  of  system  capabilities. 
In  the  case  of  the  space  shuttle  the  imaging  system  resolution,  R,y,  is  given  by 


(R„)2  (Rfy  ~(Ri)2  '  (Ru,)2  ’ 

where  Rf  is  the  resolution  of  the  particular  film,  Ri  is  resolution  of  the  lens,  and  R^,  is 
the  resolution  possible  through  an  intervening  window,  if  one  is  present  (Cox,  1974).  In 
virtually  every  case,  film  resolution  is  the  worst  of  the  three  and  can  be  used  make  a  rough 
estimate  of  the  resolution  of  the  entire  imaging  system. 

Resolution  of  films  and  windows  is  governed  by  the  materials  used  in  their  manufacture. 
Lens  resolution  depends  on  what  is  known  as  the  Rayleigh  criterion  as  well.  The  Rayleigh 
criterion  states  that  angular  resolution  is  directly  proportional  to  wavelength  and  inversely 
proportional  to  aperature  diameter.  Consequently,  maximum  spatial  resolution  (minimum 
angular  resolution)  is  achieved  at  short  wavelengths  and  large  aperatures. 
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Resolution  of  each  of  these  three  components  is  commonly  judged  by  the  accuracy  with 
which  they  can  reproduce  the  closely  spaced  parallel  lines  of  a  test  chart.  Film  resolutions 
were  listed  in  Table  2.2.  Shuttle  window  resolution  exceeds  700  lines  per  millimeter  and 
camera  lens  resolutions  each  exceed  800  lines  per  millimeter. 

Using  information  cited  here  and  in  Tables  2.1  and  2.2  of  this  work,  it  is  a  simple  matter 
to  calcualte  the  potential  surface  resolution  of  the  hand-held  and  large  format  camera 
systems.  Substitution  into  (D.l)  using  values  for  image  1  (a  hand-held  camera  image) 
results  in  a  potential  surface  resolution  of  .0104  kilometers  or  10  meters!  Substitution  into 
(D.l)  using  values  for  image  2  (a  large  format  camera  image)  results  in  a  potential  surface 
resolution  of  .0073  kilometers  or  7  meters! 

Considering  the  lower  limit  of  shuttle  orbital  operations  (about  185  kilometers)  and 
the  availability  of  high  resolution  films  and  telephoto  lenses,  even  better  potential  surface 
resolution  is  possible. 

The  narrow  field-of-view  of  long  focal  length  lenses  (e.g.,  19.5  degrees  for  a  105  millimeter 
lens  mounted  on  a  35  millimeter  camera)  need  not  be  an  overriding  concern.  A  potential 
imaging  system’s  components  can  be  varied  according  to 


where  FOV  is  field-of-view,  Wj  is  film  width,  and  F  is  the  lens  focal  length  (Jensen,  196S), 
to  achieve  the  desired  angular  coverage. 

Using  equations  (D.l)  and  (D.3),  shuttle-borne  imaging  systems  could  be  tailored  to 
almost  any  need.  (Imagery  need  not  be  restricted  to  that  collected  at  visible  wavelengths.) 


